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ESTIMATION PROCEDURES TO MEASURE AND MONITOR FAILURE 
RATES OF COMPONENTS DURING THERMAL-VACUUM TESTING 

R. E. Williams 
and 

Raymond Kruger 
ABSTRACT 

This report describes estimation procedures for measuring component failure rates, 
for comparing the failure rates of two different groups of components, and for formu- 
lating confidence intervals for testing hypotheses (based on failure rates) that the two 
groups perform similarly or differently. Appendix A contains an example of an analy- 
sis in which these methods are applied to investigate the characteristics of two groups 
of spacecraft components. 

The estimation procedures are adaptable to system level testing and to monitoring 
failure characteristics in orbit. 
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ESTIMATION PROCEDURES TO MEASURE AND MONITOR FAILURE 
RATES OF COMPONENTS DURING THERMAL-VACUUM TESTING 

INTRODUCTION 

The perfo rm ance of spacecraft components in test affects the expenditure of resources. This 
pert o. man ce may also be indicative of later performance. If one could define the failure charac- 
teristics of this performance and could compare the performance between two groups, one a test 
group and the other a control or norm, then one could better evaluate whether to expend more 
or less resources in developing these components and also predict the ion nance of these com- 

ponents during later phases of the program. 

It was the purpose of this study to develop methods for both defining the performance of a 
group of components and for quantifying the similarity of this performance to that of another 
group of components. 
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SECTION I. ESTIMATION PROCEDURE TO MEASURE FAILURE RATES 


The therm*!- vacuum test process subjects individual components or groups of components 
to temperature and vacuum cond.tions similar to what they would encounter in space. (A com- 
ponent, as referred to in this report, is an assemblage of parts making up a basic unit or “black 
box" such as a transmitter or a nower supply that performs a specific function in a system.) The 
duration of the test may vary. 

During a test, a failure may occur that completely disables the unit. In this case the test is 
generally stopped, and repain are made before the test is continued. Sometimes a failure occurs 
that does not totally disable the component. One may discover this failure at any time between 
its occurrence and the end of the test. The failure is then repaired, and the test manager may 
decide to retest. In some cases, a test is terminated with no failures. Therefore, we are looking 
at a time process that has failures (deaths), terminations (losses), and late entries (where repain 
are made and the item is retested). (It should be noted that failure discovery rates rather than 
failure rates are actually monitored.) 

To model the process, we use the Kaplan-Meier Product Limit (PL) estimation procedure as 
defined and described in (1), (2), and (3) and apply it to data as collected in Kruger-Norris (4). 

The procedure, used in the Kruger-Norris thermal-vacuum test optimization model (4), norm- 
alizes the data and uses certain criteria for goodness of fit to estimate the parameters of a modi- 
fied Duane model (5). From these parameters, one can project the failure rate into system level 
testing and then on to orbital operations. These projections are used to calculate a parameter 
called “availability” in reference (4) which, in turn, is used in the determination of optimal test 
conditions. 

The PL estimation technique provides a basis for defining failure rates and also allows for 
statistical comparisons as shown later. In order to develop PL estimates, we define the probability 


of survival. P(t). as 



P(t) = P(T > t). 


» 

( 1 ) 

T denotes the event that the component or collection of components has survived, and F(T > t) 
means the probability that the component has survived beyond time t. 

The PL estimate is based on procedures given in Kaplan- Meier ( 1 ) and r«tuoai£S F(t). The rela- 
tionship between t h PL estimate and the failure distribution for component life is given by 

F(t) * 1 - P(t). (2) 

In comp, jot level testing, we have a distribution based on component count which is de- 
fined by dividing the number of failures that have occurred up until time t by the total number 
of components, N, remaining on test. This ratio is the cumulative frequency of component fail- 
ures over time and will be called F*(t). Since the PL estimation procedure incorporates the same 
type of life information about component level testing as does F*(t). 

F*(t) = F(t) * 1 - P(t). (3) 

To estimate F*(t), we first estimate P(t). The estimate of F*(t), F*(t), is related to the 
estimate of P(t), P(t). by 

F*(t) = ! - P(t). (4) 

A A 

The PL estimate is a cons istent estimate of P(t) and thus a consistent estimate of F*(t). 

In order to calculate the PL estimate, we do the following: 

(a) The age (or time) scale is divided into suitably chosen intervals. (0. ), (n l . <jj : . 

fi } ). . . The intervals may be chosen as time intervals where the /ij’s correspond to the 

failure times or may be intervals of constant length. For notation in this report, the 
time from Uj-i to ^ is designated as Mj_j , j - 

A different estimation procedure exists for each case. By choosing time intervals 
of constant length, one may graphically monitor and compare two different processes 
oased on two different methods of normalizing the data. 
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(b) For each interval (jij.j p. one estimates the proportion of items alive just after Mj.i and 
that survive beyond ftj. 

(c) if t is a division point (it may be introduced specifically, if necessary) the proportion, 
P(t), of the population surviving beyond t is estimated by the product of the estimated 
proportion surviving each interval, pj. ( j for all intervals prior to t. 

(d) If there are ties between the failure time and time that a test ends, the time 'hat a test 
ends is taken to occur immediately after the corresponding failure time. If a failure 
and test time fall on the endpoint of an interval as defined in (a), then the test time 
goes into the following interval (to the right) as a loss, and the failure time stays in 
the interval in which it occurred. 

(e) If a repair occurs and vhe test then continues, die loss enters the process as a negative 
quantity. Let y be defined as the loss measurement. This loss occurs when items go 
off test and when items come on test. Suppose one item appeared on test during a 
time interval and two items went off test during the corresponding time interval. The 
overall loss is y * 2 - 1 = I. If no items went off test during this interval, then y * -I. 


Under the restrictions given in (a) through (e), the PL estimate is 

P(t) * IT pj., j(t). 

J-l 


(5) 


To account for the variations previously outlined and to formulate the above process in 
terms of an iterative time process, we let 

rij_j = Number of items on test at time #ij_! , 
dj.j j = Number of items failed during the interval Mj-i.j. 

yj_i j = Number of items whose test terminates ( truncates > or are put back on test (late en- 
tnes)-entered as a positive or negative number respectively -during the interval Pj-i.j- 
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From the above definitions, we have 

B . n j-i “ y j-t j “ 

Pj-i.j * -* 1 ~ . (6) 

"j-i - yj-i.j 

To illustrate the use of the above formulas, we consider the following examples: 

Example 1 : Intervals Established by Times of Failures 

Example Parameters: 

Eight items on test 
Failure times: 8,31,51.92 
Truncation times: 10. 27, 70, 121 
Entry time: 80 

The entry time corresponds to one of the items being repaired and then retested. 

We illustrate the time line schematically in Figure 1 with a failure time indicated by F, a 
truncation time indicated by T. and an entry time indicated by E. 



Figure 1 . Time Line of Events for Examples I and II 


The first calculation scheme uses intervals determined by the times of failures: thus, m o ~ 0* 
= 8 . (I, —31. M 3 =51. and n 4 = 92. 

The calculation scheme is given below in Figure 2. 
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-8 n, » no - y 0 .i - do.i n 2 * n, - y 1>2 - d 1>2 n 3 - n 2 - y 2 3 - d 2 3 

, - 1 -8-0-1 -7-2-1 -4-0-1 

y 0 j - 0 -7 -4 -3 

d 1>2 - 1 d 2 3 » 1 d 3 4 » 1 

y, 2 - 2 y 2f3 ■ 0 y 3 4 - 1 - \ - 0 


a "0 - y 0 .l “ d<U 1ft n l - Vi. 2 " d l,2 


n 2 - y 7 * - d 


2.3 ~ 


n 3 - yj.4 - d 3.4 


"o ■ yo.i 

8 - 0-1 


"i -yi.2 
7-2-1 


"2 - y2.3 
4-0-1 


n 3 - y3.4 
3-0-1 


8-0 


4-0 


3-0 


= 0.875 


0.800 


0.750 


0.667 


Figure 2. Calculation Scheme for Example I 


Using equation (S). one can find the following PL’s: 

P(8) = — = 0.875 
8 


Ur 0700 

* 5l) -& )■ (?)■ $= or - 5 


/N / M 

P(92) = - 
V8 j 


[I ) (!) ■ 0 350 


Using equation (4) we may then graph the probability of failure, F*(t), as a function of 


time as shown below in Figure 3. 
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0 8 31 51 82 

tima, t 


Figure 3. Probability of Failure vs. Time. Example I 

To illustrate how the process changes when we change the time intervals to intervals of con- 
sistent length of thirty we have the following example: 

Example 11 . Intervals Established at Fixed Times 
Mo ® 0, Mj = 30. * 60, m 3 = 90. = 1 -0 

The calculation scheme for these intervals is given in Figure 4. 

8 31 51 80 92 

F F F E F 

I i I 1 


30 
27 .. 


30 80 ' 90 120 

Mi M 2 70 M 3 M« 

n l = n 0 “ Vo 1 ” ^0 1 n 2 = n l — y 1 2 — ^1,2 n 3 = n 2 ~ V:.3 “ ^2.3 

=8-2-1 =5-0-2 ' =3-0-0 

= 5 =3 =3 

dj 2 = - d 2 3 = 0 dj 4 = 1 

y\.2 = 0 \y2.1 = 1-1 y 3,4 = 0 


n i • y i .2 - d i,2 
n l - Vl .2 


5-0-2 


n 2 - y 2 . 3 * 3 

n- : - y i 3 

3-0-0 


n 3 " V3.4 " J 3.4 I 


*3 " > 3.4 


3 - 0 - 


= 0.833 


= O.bOO 


1 000 


= 0 bt>‘ 


Figure 4. Calculation Scheme lor Example II 


Again. ?*(t) may be calculated from equation (4); a plot of P*(t) is shown in Figure 5. 



0 30 80 90 120 


tima, t 

Figure 5. Probability of Failure vs. Time, Example II 

A 

It is clear that one obtains different plots of F*(t) depending on the selection procedure for 
the time intervals. If the number of failure times is small, then one should use the failure times 
to determine the intervals With this procedure, it is still possible to monitor the piocess 

through hypothesis testing, so nothing is lost in terns of one’s ability to monitor. 

Example 111 : A Ranking Approach 

If one uses the individual times o? failures, individual times of losses, pools the time informa- 
tion. and calculates the rank of the failure time in the combmed sample, then (assuming there are 
.no late entries) p, may be estimated by 

~ N - R(t ) 

p. = L— ( 7 ) 

N - R(tj) + 1 

where R< tj ) is the rank of the j lh failure t me the combined sample We illustrate this tech- 
nique by the following example 
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In the previous example, with the entry time 80 omitted, we have in Figure 6, 


3 31 51 92 

■ F F F F 

T T T T 

t: 0 10 27 70 121 

Figure 6. Time Line of Events, Example III 

with (8, 31, 54, 92) the failure times. Consequently, the ranks of the failure tim's are 

R(8) = 1 , R(3 1 ) = 4, R(51) = 5, R(92) = 7. 

The ranks and formula (7) give 

Pj = 7/8, P 2 = 4/5. P 3 = 3/4, P 4 = 1/2. 

AAA A 

Pj , P 2 , Pj correspond to the first three Pj’s in the original example before a late ent ■ time 
takes place. If late entries occur, we readjust formula (7) to allow for this. The reat. usted form- 
ula is thus 

^ {N + Iy(t)} - R(tj) 

Pj = ~r r 1 1 — (8 > 

J {N + Sy(t)} - R<tj) + 1 

where the ranking procedure is carried out in the same manner, but the sample size N is adjusted 
if and only if and where a late entry is made. 

In the previous example, a late entry is made at t = 80. Thus, we adjust N by adding y (80) 
= 1 (one late ^ntry) to N for the calculation that involves the next rank and each rank thereafter. 
This yields 
P, = '’'8 

P ; = 4,5 

P 3 = 3/4 

^ < N + I > - 7 8+1-7 2 

p 4 = = = _ . 

(\ + i)- 7 + l 8+l-"+t 3 

We note that these estimates are now the sam u as the original estimates in Example I. 
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SECTION 2. CONSTRUCTION OF CONFIDENCE INTERVALS 

With the elation Ptocedu.es dearly defined ^ ^ ^ ^ 

of confidence interval and the nuesfion of tnon.tonng a process. To cons, rue, confidence 
intervals for F*(t), we use the fact that 

$t) - P(t) 


ra,i ° " " 0nM “ y *“ ■“ * — ° F «"> and a standard delation of one- 
) " 9Ven ^ ' qUa,i0nS <5) " by ”** I-*- (7, or (8). The vanance, V(t), „ s ,e„ aa 


( 9 ) 


V(t) = £ 

j'l n j-i CPj-l.j) 


( 10 ) 


Thus, front <9, and ,10, the confidence interval for «„ (fite unhnown h„, real- product catenate, 
at any point t in time is 

«0 - Zn,t fto Vm Z Pit, <; Pit, + z an fit) -/Oft, (1 1 ) 

or 

P(t) lies between P(t) [l ± z a/2 y/Q(t)J 

Thisjillows one to calculate bounds around any step for F*(t). Note 9 *(t) = l - p (t) or 
1 - F*(t) = P(t). 


Since P(t) = 1 - F*(t) and ftt) = 1 - 9*(t), we have 

FI') = ' + (F * (t) - •> [' * Z W2 (F*(t) - l) ( , 

As an alternate derivation, using P L and P„ designate lower and upper bounds respectively, 
it can be shown that 


where 


and 


1 - Pu < F(t) < 1 - p L 

Pu ^ p ^t) [i + z a/2 y^] 

p L = P(t) [i - z an Vvm]. 


(13) 
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SECTION 3. COMPARISON OF TWO PROCESSES 
To compare the two groups, we let 

B = ? 2 /^i (14) 

A A 

where P 2 and Pj are constructed as previously shown. If 9 is equal to 1 , then the two processes 
are the same. Stated in terms of hypothesis testing, we have 

Hq: 0 = 1, > 1 

A /\ 

where it is understood that P 2 will represent the group that has the greater P. The distribution is 

A A 

(P 2 - 9?.) 

. , -A - /N . /A « . ~ N(0, 1 ). 

{P 2 2 v 2 + 0 2 Pj 2 V,} 1 ' 2 

This implies that 

Pr (O 5s I P 2 - 0 Pj I £ Z a/2 {P 2 2 V 2 + 0 2 Pj 2 Vj} ) “ l - a. (15) 
To find confidence bounds for 0, we consider the inequality inside the probability statement. 

We thus have 

P 2 2 - 2P 2 Pj 9 + 9 Z Pj 2 < Z a/2 2 {P 2 2 V 2 + 0 2 Pj 2 Vj} (16) 

and, representing the inequality as an equality, we have 

( P 1 " Z al 2 P i V® ~ (2P 2 Pj ) ^ + (P 2 2 - Z a/2 2 P 2 2 V 2 ) = 0. (17) 

Solving the quadratic equation in (17) yields 


9 = 


A A / A a A a A a a A a A A a a A a A 

_ ~ P 2 P i ±V4P 2 2 Pj 2 - 4(P j 2 -Z tt/2 2 Pj 2 Vj)(P 2 2 -Z a/2 2 P 2 2 V 2 ) 


A A A A a"™A 

- z«/ 3 a P, J V 


(18) 


Thus, the confidence intervals for 9 are 

A A — - 

:p 2 Pj -V 


A A A O 

->(p 2 - 7 2 P 2 V ) 

r l L a 12 r i v i ; 


A A 

2P,P. + >/ 

« £ -j - L— - 


YP 2 _ 7 2 P 2 V ) 

-'-M **al 2 r i i ; 


(19) 
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where 


! 


y/~ is the term under the square root symbol in equation (18) 
Pj is calculated from equation (5) for the first group 

f> 2 is calculated from equation (5) for the second group 

Vj is calculated from equation (10) for the first group 

V 2 is calculated from equation (10) for the second group 

Zq/j is the upper ot/2 value (e.g., a = 0.05, 1^2 ~ 1-96). 

An alternate way to write the equation is 


e = e 


1 ± {1 - (l - z 0(2 ! V|)(l - Z .,; 2 V.) 


1/2 


1-7 2 V 

1 ^a/2 V 1 


( 20 ) 


[f l l < $ < Ly and 1 < L l where L l is the lower limit and L y is the upper limit, then 
P 2 > P t at that particular point in time. 

Stated another way, if the confidence interval for 6 does not include one for a point in time, 
t, then P 2 (t) > Pj(t) and the two processes are different. 
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SECTION 4. CALCULATION OF RISK FOR INCORRECT SCREENING 




It is often necessary to make a decision as to whether the individual or groups of compo- 
nents have progressed through the test screen with failure rate characteristics similar to compo- 
nents that have passed through this test screen and have performed successfully in later phases. 

In order to do this, we utilize the concepts of (1) controlling the difficulty of a screen and (2) 
the risk involved when we do not let a group of components pass the screen. 

Having a difficult screen relates to the a measurement or the probability of a Type I error 
in statistical hypothesis testing. To make the screen more difficult, we decrease the confidence 
bands around the step function and consequently increase a. The larger or becomes, the tighter 
the screen. To assess the tightness of the screen, one usually asks a question such as, “If one 
makes the decision that the components passed the screen, and thus no corrective action is taken, 
what is the consequence of this action if trouble develops later in future testing or flight programs?” 
This assessment is usually made on weighted cost or subjective engineering judgment. In the case of 
space flight, it is thought best to have a tight screen, on the adage that “it is better to catch it and 
not have it happen, than it is to not catch it and have it happen.” We adopt this philosophy in 
this screening process. On the other hand, if we do not let a group of components pass the 
screen, we encounter other costs such as those arising from redesign and retest efforts. If one can 
estimate the costs involved with the consequences of the decision to let a group of components 
pass the screen or not pass the screen, then it is possible to look at expected cost tradeoffs. To 
quantify the concepts, we use the notions of Type I and Type II errors. 

We first take as a null hypothesis the belief that the two groups perform similarly. In the 
process, two incorrec* decisions can occur. If we say that the group fails, i.e., it is unlike the con- 
trol group when in fact it is similar, then we are making an incorrect decision. Call this an incor- 
rect decision, Dj , of Type I. 

On the other hand, if we say a group passes when in fact it is unlike the control group, we 
are again making an incorrect decision. Call this an incorrect decision, D 2] . of Type II. 
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To quantify the former idea, let 

a * Pr(D,.), 0 = Pr(D 2 .) (21) 

If we use the concept, “It is better to catch it and not have it happen, than it is to not 
catch it and have it happen,” then we want a to be large enough to guarantee a rigid screen. A 
corresponding calculation of 0 is made when one chooses to not pass the test group. If the costs 
of decision D t and D 2 are respectively C 2 and C 2 , then the expected costs for these decisions 
are 

E(D,) = a C x and E(D 2 ) = 0C 2 (22) 

To monitor the process of component level testing, we operate under the null hypothesis 
Hj,: 0 = 1 for the statistic 

P 2 - 8 P, - N(0, P 2 2 V 2 + 0 2 P, 2 Vj ) 

i.e., the statistic is normally distributed with a mean of zero and a standard deviation of (P 2 V 2 + 
Q 2 Pj 2 V, )**. 

The screening process may be viewed in the following manner. If the control group and the 

A A AAA 

test groups are the same, then Pj = P : . Since 6 - P 2 / P 2 , this implies that 0 = 1 when the groups 
are alike. 

If we construct confidence intervals on 9 from the test data and these confidence intervals 
do not cover 0 = 1, then we reject the null hypothesis, Hq : 0 = 1, and the group does not pass 
the control test. 

From a program to calculate product limit estimates for a control group and a test group. 

Aw Aw 

as shown in Appendix A, we obtained P 2 = 0.8759 and P t = 0.9482 with V 2 (the variance of 
P 2 ) = 0.001 10 and Vj (the variance of P, ) = 0.00028. 

Confidence intervals for 0 at a = 0.20 were (0.8803. 0.9680). Since these confidence inter- 
vals do not cover 0 = 1 . we reject H 0 : 0= 1. To calculate 0 , we do the following: 


(a) Calculate a confidence interval about 0 for H^,: 0 = 1, a = 0.20. This confidence in- 
terval corresponds to the Z acceptance region, -1.28 ^ Z ^ +1.28 where 


A A 

P 2 ~ 9 P 1 


{f 2 2 V 2 + 0 2 Pj 2 V t } M 


z = (i - e/d) {0 2 + (did) 2 v,} 1/2 . 

For a - 0.20 (Z = 1.28), we have the following inequality 

% * 9 p i 

-1.28 ^ ~ a > 77 , ^ 128 - 

{P 2 2 V 2 + 0 2 P 2 V } 1/2 


Letting 


C = {p 2 2 V 2 + d 2 Pj 2 Vj} 1/2 


and, solving for P 2 f ? l , we have 


1.28C 


1.28C 


< p 2 / Pl 


For the null hypothesis, Hj,: 0 = 1, equation (26) reduces to 

1.28C ^ 1.28C 

1 - -S— £ 0 — 1 + 


(b) Substitute for0 a value that depends on Z, C, and the new belief about 0. Since Z = 
(0 -0 N )/C, 

0 = ZC + 0 N (28 

A A 

where 0 N is a new estimate which equals P 2 /Pj • 

A 

Substituting for 0 in equation (27) and making a probability statement, we have 


[ 1.28C\ / 1.28C 

‘ 3 = Pr (' )- ZCt ^ - 1 + T“ 
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0 - Pr 


/ 1.28C\ 1 ^ / 1.28C \ 1 

[('•'" - x)c szs r*"* tv* J 


As an illustration of the calculation of 0, we have from our calculations shown in 
= 0.9482, V, * 0.00028 and 
£ 2 = 0.8759, V 2 = 0.00110. 

Then, from the definition of 0 N as in equation (28), 

0.8759 


* 


0.9482 


= 0.9238. 


As before, we let 


C = {(0.8759) 2 (0.00110) + (0.9482) 2 (0.00028)} 1/2 
= 0.0331 

and from equation (29), 

1 

.0331 


fr 1.28(0.0331)1 

* ■ *11° - °' 9238) ' J ol 

r ( , . 0 . 9238 + »■»«»>”» ] _lA 

L 0.9482 J 0.0331^ 


<>Z<> 


= Pr {0.9581 <, l < 3.6580} . 
From the definition of the right-hand term, 

• 3.6580 


0 = 


| /•i.bSi 

'J 2 * •'0.95SI 


e* 1/2 X dx = 0.169. 


(29) 

Appendix A 
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SECTI0N5. CONCLUSIONS 

Using the PL process, one can monitor failure characteristics during component and system 
level testing and also during the orbital mission. This process enables one to make confidence 
statements about the failure characteristic of a test group and to compare the characteristic with 
that of a control group. This can be done graphically with statistical hypothesis testing. 

One can calculate risks and have quantitative guidelines for making decisions as to whether 
to continue testing or not. 

Using the step function to represent F*(t), one can obtain regression estimates of a Duane 
type mode to project failure characteristics from component level testing to system level and on 
to the orbital mission. The process gives the same form of an estimation procedure at each stage 
of testing. 

The process converges in expected mean square to the theoretical distribution. This means 
that estimates of F*(t) based on this process will be a good representation of the failure 
characteristic. 
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APPENDIX A 

EXAMPLE OF ANALYSIS USING ACTUAL DATA 

The data used in this example come from two entirely different groups of spacecraft compo- 
nents. The first group, noted as record numbers 1 to 931, is taken at random from a group of 
tests conducted in the period 1966 to 1976. The second, records 941 to 1631, is data from a 
group of components for one spacecraft tested in 1976 to 1978. Ten record numbers are used 
to maintain the data for one test so 94 tests are included in the first group and 60 tests in the 
second. Any test may contain one or more components. 

Figure A-l is a simplified flow diagram for the computer program used to reduce the data. 
The program itself is written in a modified BASIC language for a Hewlett-Packard 983 1 A desk- 
top calculator. Block 1 of Figure A-l basically performs program housekeeping functions. 

Blocks 2, 3, and 4 select the data to be analyzed and perform functions necessary to put the 
data into a form compatible with the statistical analysis requirements. The value for Z entered 
in block 6 determines the stringency of the comparison between the two groups of data. The 
comparison is done in block 7; if the upper and lower limits for d do not include l.C as in equa- 
tion (19). then the hypothesis that the two groups are alike is rejected and the acceptance region 
for the alternate hypothesis is computed. 

Figure A-2 is a copy of the print-out of the data as adjusted and entered. The note at the 
top signifies that a 24 hour interval will be used in the analysis. The first case, as noted previ- 
ously, includes records 1 through 931. The number of components actually listed in the data is 
1 87. However, tests exist in the data wherein more than one failure per component exists. This 
condition is impossible in the Kaplan-Meier ( 1 ) approach and so the data was adjusted to in- 
crease the number of components so that at least one component was on test during each test 
period. The problem, in part, exists because of the uncertainty in designating a component: a 
complex item is normally designated as being made up of more than one component. However. 
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Figure A-i. Simplified Flow Diagram for Program to Reduce Data 
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TIME INCREMENT SET AT 24 

1st CASE! BEGIN RECORD 1 > END RECORD 931 

INITIAL COMPONENT COUNT: 187 * ADJUSTED COUNT* 198 


ROM 

HOUR 

CODE 

#C0MP 

ROM 

HOUR 

CODE 

#C0MP 

1 

109 

3 

4 

86 

232 

3 

2 

2 

98 

3 

1 

87 

26 

3 

15 

3 

48 

3 

2 

88 

26 

1 

1 

4 

182 

2 

0 

89 

33 

3 

1 

5 

296 

3 

4 

90 

57 

3 

2 

6 

192 

1 

1 

91 

60 

3 

1 

7 

212 

2 

-1 

92 

33 


0 

8 

212 

I 

1 

93 

126 

3 

1 

9 

230 

3 

4 

94 

26 

3 

1 

10 

149 

3 

1 

95 

26 

3 

1 

11 

34 

1 

1 

96 

53 

3 

1 

12 

162 

1 

1 

97 

54 

3 

1 

13 

231 

3 

2 

98 

135 

•5 

0 

14 

45 


0 

99 

207 

3 

6 

15 

135 

3 

2 

100 

33 

1 

1 

lb* 

216 

3 

4 

101 

41 

2 

-1 



Figure A-2. Data as Entered by Test 


no exact definition exists. A better unit might be a part (e.g., a resistor or a capacitor), a device 
whose function is irreparably destroyed by disassembly. The large number of parts per compo- 
nent essentially obviates the possibility of having more failures than parts and eliminates the need 
for the adjustment in count. The adjusted component count is shown as 1 98. 

The data itself is stored, test by test, in an array of 450 rows by 3 columns. Only rows 
having data in them are presented, and the output splits the data in half listing both halves simul- 
taneously; therefore we have row 1 alongside row 86, there being 169 rows of data. 

The data can be interpreted as follows: 

Row 1 : A test lasting 109 hours and then terminating (code 3) with the 4 components being 

taken off test. 
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Row 2: A test luting 98 hours and then terminating with 1 component being taken off tut. 
Row 3: A test lasting 48 hours and then terminating with 2 components being taken off test. 
Row 4: A test interrupted (code 2) at 182 tours. No failures occurred before the interruption 
and so no repairs or tote estrtes are involved; #COMP ■ 0. 

Row S: The test in which the interruption in row 4 occurred is terminated at 296 tours and 
the 4 components that were on test are removed. 

Row 6: A test in which a failure (code 1) occurs at 192 hours. One failure is noted under 

#COMP. 

Row 7: Thw tut is interrupted (code 2) at 212 hours and previous failures are repaired. The -1 
designation is a late entry of the repaired component. The tim spent during the inter- 
ruption is subtracted from the in-test time. 

Row 8: A failure occurs as soon as the test is begun again. 

Row 9: The test continuu until time 230 when it is terminated and the remaining operating 
item, 4, is taken off tut. 

It can be seen that the code 3 designators separate individual tests. 

After ail the data from a group has been entered into the array, the array is sorted in order 
of increasing time (column 1) and the data can be output as shown in Figure A-3. The informa- 
tion in rows 2 and 1 1 in Figure A-2 now appear in rows 102 and 86, respectively, in Figure A-3. 
Some, such as row 89 in Figure A-2 cannot be exactly identified ?nce both rows 94 and 95 m 
Figure A-3 contain the same data. This is of no consequence ana simply indicates that some 
other test (not shown) contained data having the same values. 

Figure A-4 shows the statistical analysis of the data of the fust case. Column designations 
are compatible with the terminology of this report. Note that the analysis is terminated when 
fewer than 20 components are on test. This was done because of the rapid increase in variance. 
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ROW HOUR CODE #COMP ROW HOUR CODE #COMP 
1 0 1 1 86 34 1 1 

22 1 1 37 36 1 1 

3 2 2 -1 88 87 3 1 

4 2 1 1 39 39 3 1 

5 3 1 1 90 98 3 2 

6 3 1 1 91 91 2-1 

7 4 1 1 92 92 3 1 

3 3 1 1 93 92 3 2 

9 5 1 1 94 93 3 1 

10 3 2 -1 95 93 3 1 

11 7 1 1 96 94 3 1 

12 8 1 1 97 94 3 1 

13 12 3 1 98 94 3 1 

14 13 3 1 99 95 3 1 

15 14 3 1 100 95 3 1 

16 17 3 2 101 37 3 1 



Figure A-3. Sorted Data 
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Figure A-4. Analysis of First Case Data 


A- 5 



1 

The data for the second case, records 941 to 1631, were operated on the same way as that 
of the first case and the statistical analysis is ,hown in Figure A-S. 

Finally, the two groups of data are compared. A value of Z = 1 .28 was selected and the 
comparison can be seen in Figure A-6. 

From Figure A-6, we can see that the null hypothesis (the two groups are alike) can be 
rejected in the period 0 to 24 hours. With the acceptance region of 0.9S81 to 3.6S80 then, there 
exists a 1 6.9 percent probability that the observed value will fall in the non-rejection region when 
the alternative hypothesis is true. 

Figure A-7 graphically compares the two groups of components. While differences are 
apparent, the analysis as in Figure A-6 is needed to attribute confidence interval statements as to 
the degree of similarity. 
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Figure A-5. Analysis of Second Case Data 
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Figure A-6. 

Comparison of the Two Groups of Data 
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